Performance Analysis of Privacy Preserving Naïve Bayes Classifiers for Distributed Databases
نویسندگان
چکیده
The problem of secure and fast distributed classification is an important one. The main focus of the paper is on privacy preserving distributed classification rule mining. This research paper addresses the performance analysis of privacy preserving Naïve Bayes classifiers for horizontal and vertical partitioned databases. The Naïve Bayes classifier is a simple but efficient baseline classifier. We compare the performance of our two proposed privacy preserving Naïve Bayes protocols with basic Naïve Bayes classifier (NBC). First protocol used Un-trusted Third Party (UTP) for privacy preserving Naïve Bayes classifier for horizontally partitioned data and second protocol used secure multiplication protocol for privacy preserving Naïve Bayes classifier for vertically partitioned data. The results analysis shows that our protocols execution time is less than the existing NBC execution time since in our protocol, all parties individually calculate their probability or model parameters as an intermediate result and transfer only these intermediate results for further calculations. Accuracy of test data is same because calculated model parameters of training data are same. Our protocols are very easy to follow, understand with minimum efforts, secure and fast.
منابع مشابه
Privacy Preserving Naïve Bayes Classifier for Horizontally Distribution Scenario Using Un-trusted Third Party
The aim of the classification task is to discover some kind of relationship between the input attributes and the output class, so that the discovered knowledge can be used to predict the class of a new unknown tuple. The problem of secure distributed classification is an important one. In many situations, data is split between multiple organizations. These organizations may want to utilize all ...
متن کاملPrivacy Preserving Naive Bayes Classifier for Horizontally Partitioned Data
The problem of secure distributed classification is an important one. In many situations, data is split between multiple organizations. These organizations may want to utilize all of the data to create more accurate predictive models while revealing neither their training data / databases nor the instances to be classified. The Naive Bayes Classifier is a simple but efficient baseline classifie...
متن کاملPrivacy Preserving Näıve Bayes Classifier for Vertically Partitioned Data
Privacy-Preserving Data Mining – developing models without seeing the data – is receiving growing attention. This paper assumes a privacy-preserving distributed data mining scenario: data sources collaborate to develop a global model, but must not disclose their data to others. Näıve Bayes is often used as a baseline classifier, consistently providing reasonable classification performance. This...
متن کاملPrivacy Preserving Naïve Bayes Classifier for Vertically Partitioned Data
Privacy-Preserving Data Mining – developing models without seeing the data – is receiving growing attention. This paper assumes a privacy-preserving distributed data mining scenario: data sources collaborate to develop a global model, but must not disclose their data to others. Näıve Bayes is often used as a baseline classifier, consistently providing reasonable classification performance. This...
متن کاملPrivacy-preserving naive Bayes classification on distributed data via semi-trusted mixers
Distributed data mining applications, such as those dealing with health care, finance, counter-terrorism and homeland defense, use sensitive data from distributed databases held by different parties. This comes into direct conflict with an individual’s need and right to privacy. It is thus of great importance to develop adequate security techniques In this paper, we consider privacy-preserving ...
متن کامل